Search results for "Expression data"

showing 10 items of 11 documents

Detection of Vasodilators From Herbal Components by a Transcriptome-Based Functional Gene Module Reference Approach

2019

Vasodilatation is one of the key therapeutic strategies for the treatment of various cardiovascular diseases with high blood pressure. Therefore, development of drugs assisting blood vessel dilation is promising. It has been proved that many drugs display definite vasorelaxant effects. However, there are very few studies that systemically explore the effective vasodilators. In this work, we build a transcriptome-based functional gene module reference approach for systematic pursuit of agents with vasorelaxant effects. We firstly curate two functional gene modules that specifically involved in positive and negative regulation of vascular diameter based on the known gene functional interactio…

0301 basic medicinePharmacologyherbal componentDrug discoverylcsh:RM1-950Functional genesVasodilationComputational biologyBiologygene moduledrug discoveryTranscriptome03 medical and health sciencesGene expression database030104 developmental biology0302 clinical medicinelcsh:Therapeutics. PharmacologyGene Modules030220 oncology & carcinogenesisGene expressiongene expression profilePharmacology (medical)GenevasodilatorOriginal ResearchFrontiers in Pharmacology
researchProduct

An Extension of the DgLARS Method to High-Dimensional Relative Risk Regression Models

2020

In recent years, clinical studies, where patients are routinely screened for many genomic features, are becoming more common. The general aim of such studies is to find genomic signatures useful for treatment decisions and the development of new treatments. However, genomic data are typically noisy and high dimensional, not rarely outstripping the number of patients included in the study. For this reason, sparse estimators are usually used in the study of high-dimensional survival data. In this paper, we propose an extension of the differential geometric least angle regression method to high-dimensional relative risk regression models.

Clustering high-dimensional dataComputer sciencedgLARS Gene expression data High-dimensional data Relative risk regression models Sparsity · Survival analysisLeast-angle regressionRelative riskStatisticsEstimatorRegression analysisExtension (predicate logic)High dimensionalSettore SECS-S/01 - StatisticaSurvival analysis
researchProduct

GenClust: A genetic algorithm for clustering gene expression data

2005

Abstract Background Clustering is a key step in the analysis of gene expression data, and in fact, many classical clustering algorithms are used, or more innovative ones have been designed and validated for the task. Despite the widespread use of artificial intelligence techniques in bioinformatics and, more generally, data analysis, there are very few clustering algorithms based on the genetic paradigm, yet that paradigm has great potential in finding good heuristic solutions to a difficult optimization problem such as clustering. Results GenClust is a new genetic algorithm for clustering gene expression data. It has two key features: (a) a novel coding of the search space that is simple, …

Clustering high-dimensional dataDNA ComplementaryComputer scienceRand indexCorrelation clusteringOligonucleotidesEvolutionary algorithmlcsh:Computer applications to medicine. Medical informaticscomputer.software_genreBiochemistryPattern Recognition AutomatedBiclusteringOpen Reading FramesStructural BiologyCURE data clustering algorithmConsensus clusteringGenetic algorithmCluster AnalysisCluster analysislcsh:QH301-705.5Molecular BiologyGene expression data Clustering Evolutionary algorithmsOligonucleotide Array Sequence AnalysisModels StatisticalBrown clusteringHeuristicGene Expression ProfilingApplied MathematicsComputational BiologyComputer Science Applicationslcsh:Biology (General)Gene Expression RegulationMutationlcsh:R858-859.7Data miningSequence AlignmentcomputerSoftwareAlgorithmsBMC Bioinformatics
researchProduct

Regularized Regression Incorporating Network Information: Simultaneous Estimation of Covariate Coefficients and Connection Signs

2014

We develop an algorithm that incorporates network information into regression settings. It simultaneously estimates the covariate coefficients and the signs of the network connections (i.e. whether the connections are of an activating or of a repressing type). For the coefficient estimation steps an additional penalty is set on top of the lasso penalty, similarly to Li and Li (2008). We develop a fast implementation for the new method based on coordinate descent. Furthermore, we show how the new methods can be applied to time-to-event data. The new method yields good results in simulation studies concerning sensitivity and specificity of non-zero covariate coefficients, estimation of networ…

Clustering high-dimensional databusiness.industryjel:C41jel:C13Machine learningcomputer.software_genreRegressionhigh-dimensional data gene expression data pathway information penalized regressionConnection (mathematics)Set (abstract data type)Lasso (statistics)CovariateArtificial intelligenceSensitivity (control systems)businessCoordinate descentAlgorithmcomputerMathematics
researchProduct

Pathway network inference from gene expression data

2014

[EN] Background: The development of high-throughput omics technologies enabled genome-wide measurements of the activity of cellular elements and provides the analytical resources for the progress of the Systems Biology discipline. Analysis and interpretation of gene expression data has evolved from the gene to the pathway and interaction level, i.e. from the detection of differentially expressed genes, to the establishment of gene interaction networks and the identification of enriched functional categories. Still, the understanding of biological systems requires a further level of analysis that addresses the characterization of the interaction between functional modules. Results: We presen…

ESTADISTICA E INVESTIGACION OPERATIVAGene regulatory networkGene ExpressionInferenceSister chromatidsOxidative Phosphorylation//purl.org/becyt/ford/1 [https]Structural BiologyEstadística e Investigación OperativaGene Regulatory NetworksTopology (chemistry)Alzheimers-DiseaseGeneticsDIBUJOBiological systemsApplied MathematicsSystems BiologyCell Cycle//purl.org/becyt/ford/1.2 [https]Computer Science ApplicationsMicroarray experimentsModeling and SimulationIdentification (biology)Functional assessmentDNA-replicationFunctional connectionsGlycolysisCIENCIAS NATURALES Y EXACTASPathway NetworkDNA ReplicationSaccharomyces-CervisiaeBioinformaticsS-phaseSystems biologyGenomicsComputational biologySaccharomyces cerevisiaeBiologyGene interactionAlzheimer DiseaseModelling and SimulationGenomic dataPANAPathwaysMolecular BiologyUbiquitinResearchGene Expression ProfilingR packageGluconeogenesisGene expression profilingComputingMethodologies_PATTERNRECOGNITIONPurinesCiencias de la Computación e InformaciónProteolysisGene expression dataCiencias de la Información y BioinformáticaUbiquitin conjugationPathwayBMC Systems Biology
researchProduct

Sector identification in a set of stock return time series traded at the London Stock Exchange

2005

We compare some methods recently used in the literature to detect the existence of a certain degree of common behavior of stock returns belonging to the same economic sector. Specifically, we discuss methods based on random matrix theory and hierarchical clustering techniques. We apply these methods to a portfolio of stocks traded at the London Stock Exchange. The investigated time series are recorded both at a daily time horizon and at a 5-minute time horizon. The correlation coefficient matrix is very different at different time horizons confirming that more structured correlation coefficient matrices are observed for long time horizons. All the considered methods are able to detect econo…

FOS: Economics and businessPhysics - Physics and SocietyStatistical Finance (q-fin.ST)SYSTEMSEXPRESSION DATAQuantitative Finance - Statistical FinanceFOS: Physical sciencesFINANCIAL-MARKETSDisordered Systems and Neural Networks (cond-mat.dis-nn)Physics and Society (physics.soc-ph)Condensed Matter - Disordered Systems and Neural NetworksMATRICESNOISE
researchProduct

Ranking Series of Cancer-Related Gene Expression Data by Means of the Superposing Significant Interaction Rules Method

2020

The Superposing Significant Interaction Rules (SSIR) method is a combinatorial procedure that deals with symbolic descriptors of samples. It is able to rank the series of samples when those items are classified into two classes. The method selects preferential descriptors and, with them, generates rules that make up the rank by means of a simple voting procedure. Here, two application examples are provided. In both cases, binary or multilevel strings encoding gene expressions are considered as descriptors. It is shown how the SSIR procedure is useful for ranking the series of patient transcription data to diagnose two types of cancer (leukemia and prostate cancer) obtaining Area Under Recei…

Male0301 basic medicineKey genesComputer sciencelcsh:QR1-502Binary numberBiochemistrylcsh:MicrobiologyArticlePattern Recognition AutomatedStructure-Activity Relationship03 medical and health sciencesBig data0302 clinical medicinerankingData MiningHumanscancergene expressionsRelated geneCàncerMolecular BiologyOligonucleotide Array Sequence AnalysisCancerPròstata -- CàncerLeukemiaReceiver operating characteristicbusiness.industryGene Expression ProfilingleukemiaProstatic NeoplasmsLeucèmiaDades massivesPattern recognitionprostate cancerExpressió gènicaSSIR method030104 developmental biologyROC Curvemultilevel fingerprintsExpression dataData Interpretation Statistical030220 oncology & carcinogenesisProstate -- CancerArtificial intelligenceGene expressionbusinessAlgorithms
researchProduct

Kullback-Leibler distance as a measure of the information filtered from multivariate data

2007

We show that the Kullback-Leibler distance is a good measure of the statistical uncertainty of correlation matrices estimated by using a finite set of data. For correlation matrices of multivariate Gaussian variables we analytically determine the expected values of the Kullback-Leibler distance of a sample correlation matrix from a reference model and we show that the expected values are known also when the specific model is unknown. We propose to make use of the Kullback-Leibler distance to estimate the information extracted from a correlation matrix by correlation filtering procedures. We also show how to use this distance to measure the stability of filtering procedures with respect to s…

Physics - Physics and SocietyKullback–Leibler divergenceStatistical Finance (q-fin.ST)Covariance matrixEXPRESSION DATAFOS: Physical sciencesQuantitative Finance - Statistical FinanceMultivariate normal distributionPhysics and Society (physics.soc-ph)Measure (mathematics)Stability (probability)Hierarchical clusteringDistance correlationFOS: Economics and businessPhysics - Data Analysis Statistics and ProbabilityStatisticsTime seriesAlgorithmData Analysis Statistics and Probability (physics.data-an)MATRICESMathematics
researchProduct

Discriminating Graph Pattern Miningfrom Gene Expression Data

2016

We consider the problem of mining gene expression data in order to single out interesting features characterizing healthy/unhealthy samples of an input dataset. We present an approach based on a network model of the input gene expression data, where there is a labelled graph for each sample. To the best of our knowledge, this is the first attempt to build a different graph for each sample and, then, to have a database of graphs for representing a sample set. Our main goal is that of singling out interesting differences between healthy and unhealthy samples, through the extraction of "discriminative patterns" among graphs belonging to the two different sample sets. Differently from the other…

Settore INF/01 - InformaticaBiological networkPattern miningGene expression dataSoftware
researchProduct

Sparse relative risk regression models

2020

Summary Clinical studies where patients are routinely screened for many genomic features are becoming more routine. In principle, this holds the promise of being able to find genomic signatures for a particular disease. In particular, cancer survival is thought to be closely linked to the genomic constitution of the tumor. Discovering such signatures will be useful in the diagnosis of the patient, may be used for treatment decisions and, perhaps, even the development of new treatments. However, genomic data are typically noisy and high-dimensional, not rarely outstripping the number of patients included in the study. Regularized survival models have been proposed to deal with such scenarios…

Statistics and ProbabilityClustering high-dimensional dataComputer sciencedgLARSInferenceScale (descriptive set theory)BiostatisticsMachine learningcomputer.software_genreRisk Assessment01 natural sciencesRegularization (mathematics)Relative risk regression model010104 statistics & probability03 medical and health sciencesNeoplasmsCovariateHumansComputer Simulation0101 mathematicsOnline Only ArticlesSurvival analysis030304 developmental biology0303 health sciencesModels Statisticalbusiness.industryLeast-angle regressionRegression analysisGeneral MedicineSurvival AnalysisHigh-dimensional dataGene expression dataRegression AnalysisArtificial intelligenceStatistics Probability and UncertaintySettore SECS-S/01 - StatisticabusinessSparsitycomputerBiostatistics
researchProduct